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Kale IdoScope — Miriingtfre Int&mBt 

This document describes a new concept, Kaleidoscope, which mines the Internet to 
augment data sources with real world infonnatkm. 

Today the Internet is the most up-to-date and complete description of the world .If 

there is something happening, or you are interested in finding a specific piece of 

and moWpeople ate toning intuitively to the Internet The internet 
mw^Ss an incredible array of information, from real-time data such as news, 

and weather to long-term sources of description about the world such 
a? corporate home pages, product infration. geographic ^ ^^f™ 5 
SscriDtions encyclopedias, dictionaries and other resource material. As of late 1999, 
ttSShoSboWl billionpages of information, and mis figure is growmg 
dramatically. 

The world today is focused on the Internet as a medium for connnunkation, 
miction, dSbntion and transaction. Kaleidoscope is a phuosoplncal shifi - 
^SSg Ae Internet as the world's largest and most up-to-date description of the 

world. 



Every customer-oriented company is hungry for mformaUon on its ttistomers- what 
beSr place to search than (he Internet Kaleidoscope works by looking at each 
SXnina company's database, and searching through foe Internet tc .wgmart 
SJavanable on this transaction and its customers. This data augmeul^on adds 
Zi onW^SSned descriptive keywords to each transaction, hut also keywords 
mat work towards describing the key values behind the transaction. 

Kaleidoscope can be used in a number of different ways, each of which wfllbe 
d^SSfofiXr detail. As a segmentation engine. Kaleidoscope can search 
SSaa Tmtemet-«tpanded customer database to find people in many different 

foSmS. those who have just had children who maybe interested* 
investing in pharmaceutical stocks, or who enjoy baseball. 

This document describes the core Internet ^^^^^^^^^ 
some of the general applications that can take advantage of this technology, and some 
of the Industries to which this technology can be applied. 



PAGE 21/60 * RCVD AT SV872005 4:45:52 PM [Eastern Dayfight Time] * SVR:USPTO-ffXRF-6/26 * DN1S:2738300 * CSID:650 474 8401 * DURATION (mm-ss):13.20 



08/08/2005 MON 13:51 FAX 650 474 8401 GLENN- PATENT GROUP 

Disclosure Document Proprietary and Confidential 



Kaleidoscope 



HowitWoiks 

At the highest level, Kaleidoscope works by taWng taction ^ record data torn 

Tlarise database, and searching the Internet for detailed descnpfaons and 

"S^SLofthnt data. The objective is to utilize the Internet as the , taqg .most 
2Ed most up-to-date description of me world. THe descriptive data that . 
SSed fern the web searches is then added to the original datt to prov.de 
S^SaSwitb a richer internal data source for various apphcat.ona. 

of Kaleidoscope is shown below. 




Kaleidoscope Internet 
Analysis Engine 



] 




For each Mdiobe 

augmented. *o 
Kflfe/dbacqpe Engne 

analyzes the Internet 

descriptive data. 



ETe 2Sa for Field #5. Examples of how this system may be applied to real- 
world data environments are listed in a latex section. 
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Smart Caching Technology 

In its direct form, the system shown above will generally be too slow for modern 
larne-scale data systems. Modem transaction processing systems are required to 
Socess in the order of 100 to 1000 transactions per second, makmg external arcess to 
a comparatively slow resource such as the Internet an impractical endeavor. This can 
be easily remedied by an intelligent caching system; 

Data fields presented to Kaleidoscope for analysis will generally have from between 
SSut ToS Snot values up into the millions of distinct values. Below ■about 1000, 
Se d values are most likely to have explicit operational meiumig, arm the added 
value of Kaleidoscope will be less. The cardinality of data sets can be used to 
establish an intelligent caching system. 

in addition to the cardinality issue, most descriptions mat Kaleidoscope createswill 
£ SrSTties that may change week-to-week or month-to-month, but are unhkely to 
to changing minute by minute. As such it will be appropriate to cache the . 
otsSonf for each value of a field, and update them only as reo^. Thaupdate 
can^ccur on a regular basis or upon actively detect^ changes in 
SonSes from which the analysis has been done. In this ■Banner, very h^gh-speed 
SoSnaugmentation can occur, with periodic updates to the cache to keep the 
descriptions current 

The diagram below shows a high level configuration for me caching system. 



Kaleidoscope Internet 
Analysis Engine 



Cache 
Manager 
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Value - Description 
Database . 



Database Access 



s High 

£1 



Original Transaction ^en f «on *ugm^on 
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In the above diagram* the Value-Description Database contains the cached 
descriptions for each field value. When an unknown value is encountered, this results 
in a cache-miss which is detected by the cache manager and passed to the 
Kaleidoscope Engine. The Kaleidoscope Engine executes the various Internet 
searches and other tasks, and creates the description. This is then put into the cache as 
a new value-description record. 

The cache manager also utilizes the Kaleidoscope Engine to update the descriptions 
in the cache as required* This can be triggered on a time schedule, by detecting 
changes in the Internet source from which each given description was created, and/or 
by other external signals. 

Because the database is relatively simple in structure (although potentially large in 
size), high transaction speeds wiD be easily achieved. In this manner this form of, 
intelligent cache has enabled high-speed access while not compromising the quality 
of the core data augmentation process* 
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Example Applications of Kaleidoscope 

This section describes a number of example applications for the Kaleidoscope 
technology. These are a subset of the full range of areas in which Kaleidoscope will 
be able to provide real value. 

Payment Card Transaction Augmentation 

Credit card transactions contain only a rudimentary description of the actual product 
or service that was purchased The key indicator typically used is the Standard 
Industry Code field (often referred to as the "SIC code**) which is a four digit integer 
numeric field. Unfortunately the SIC code is not always accurately mapped; and has a 
wide variation in resolution. Some SIC codes identify individual companies, whilst 
, others collapse an entire industry into a single code. 

The other key descriptive field in the transaction is the merchant name* This is a text 
string which typically contains some form of the merchant's name. This field is 
mainly used for statement generation purposes, and as such is often compressed and 
shortened. From an analytic viewpoint, even accurately knowing the company's 
name is of little value as translating the name into a business or product category is 
difficult 

This is where Kaleidoscope can add unique value. By talcing the merchant name as 
the field to be augmented, Kaleidoscope can go onto the web and create a description 
for that merchant in terms of known English keywords. For example, a transaction at 
«Brecht BMW" would return keywords such as "automobile" and -luxury* \ A 
transaction at "Vista Point Hti M would return "hotel", "holiday", and "resort". The 
Keywords can be extracted based on the type of marketing that is being done to scH 
the product In this way, the reasons people buy services and products can be 
extracted, along with the products themselves. 

These keywords, in appropriate symbolic form, would then be added to the 
transaction record. For the first time, analysts and marketers would be able to search 
for and cluster their Payment card customers based on English descriptions of the 
products and services they purchase. A more detailed description of an analytic 
marketing tool based on Kaleidoscope is shown later. 

There are various domain specific modules required to build Kaleidoscope for the 
Payment card industry. These ate captured in the diagram below: 
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Kaleidoscope Internet Analysis Engine 



URL Locating 
Modules 
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Value - Description 
Database 



The Merchant ID Database bold, the full set of merchant Ids s^xm&e faction.. 
A. ^1 of URL Locating Modules Beans various Internet resources to locate 
^^aSen^URLs for each n*rchant Id. Qncc located^ tr^URLsare^ 
gSeSS^L Analysis Modules, which are ^^SJdSeSSanS 
JSapany and/or service descriptive characteristics from each of the Afierer* Internet 

resources. 

The URL Locating Modules include the following: 

♦ Direct access to company home pages based on a name search. This n»y be 
augmented through a lookup in the RealNames " database and/or company 
listings online. 

• Searches of financial websites for company descriptions. 

« Searchcsofmdustrywcbpa^ 

specific resource sites. There is a large number of local listing pages for each 
city and/or state on the Internet today. 

Each URL module may report multiple URLs that contain apprepriatedescripdons. 
Ttomost suitable (as determined by the various modules) will be selected as bemg 
die definitive set for that merchant or service. 

The URL Analysis Modules takes each of the resources P^viaed by die URL 
Locating Modules and condense them.appropnatcry io construct the final set of 
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Kaleidoscope 

descriptive keywords. These modules are designed around specific Internet resources, 
including the following: 

• Company "about" and "history" pages. 

. Company quarterly / annual reports and letters to shareholder. 

. Financial web-site corporate profiles. 



• Standard Industry Classifications (SIC) code databases. 

• Search engine sxrmrriarized descriptions and categorizations (where 
available). This includes descriptions fiom RealNames . 

fa an offline ermronment, full population keyword analysis is done ^ cn^rrc that the 
keywords being extracted are sufficient to describe the merchants arid service 

Sise the descriptions arc human readable, random 4^-™"°* 
JlSsible to determine the effectiveness of each of the modules described above. 
Because these modules are reasonably Juristic, imrniovernents can be made 
lively by analyzing and comparing the perforrnance of each module on its target 
subset of the data. 

Retail Product Description 

Hetailers will be able to use the same technology to «*egori» ^dnctsfatoamore 
diarized description. Product names can be indexed against the Internet through 
nSn&ctnrer web pages, product review sites, online merchandisers and so on. 

The applications that could utilize this data include general n^thtgand cross-sell 
opportunities, as well as tracking individual customer product preferences at a more 
descriptive level- 
Telecommunications Can Destination Description 

Telephone can destination numbers can be indexed to company nainea and to there 
indexed to descriptions of those companies. This would require an additional 

specific module to convert &>m te dM J**" 
number to . cdrrrpany or service provider name. Personal phone <^toradent.al 
numbers would be utilized in this system to the extent that the system can match 
personal phone numbers to personal web content 

Credit Bureau Augmentation 

There are a small number of fields mat could be augmented in a credit J>?*«^£. 
example, where the employer's name is known or where loan collateral is described 
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on the credit report this information can be indexed against the web to more fully 
describe it This may add useful data for credit approval and general credit 
application processing. 

Collections 

In the collections environment, Kaleidoscope can be used to describe the transaction 
history of the individual at any point in time up to and including the time that they 
went delinquent The additional information would be valuable in determining the 
Specific actions to be taken on each collections case or whether the debt should be 
managed in house or sold to a third party. 

Investment Data 

The securities an individual invests in axe an indication of the type of person they are, 
where their interests He and their propensity to assume risk. Kaleidoscope can extract 
descriptions of companies in the same manner described above and use It to describe 
the individual in question. The recent federal ruling that allows financial institutions 
to offer securities to their account holders would allow us to combine the 
Kaleidoscope investment keyword descriptions with the payment cards described 
above. 

In a large bank that offers cross-industry services, this data may be useful in cross- 
sell opportunities. 

Affinity Program / Cross Sell Targeting 

For a financial institution, airline or other large company that runs and manages an 
affinity program, Kaleidoscope can take advantage of the transaction and account 
data to better target product offers to individuals. Offers can be described in English 
language terms and then used to match against the individuals' keyword descriptions. 

In a large financial institution, financial products could be described by keywords, 
which could then be matched against customer keyword descriptions and used to 
target one-to-one offers. The deployment of the offer could be via the institutions 
Internet web site. 

Because an. individuals Keyword profile is changing in real-time, the opportunity to 
target the offers at the appropriate time for any individual consumer is greatly 
enhanced. 

News and Personalization Products 

Whenever an institution wishes to personalize offerings such as a portal web-site, 
news services, information services and so on, they can use Kaleidoscope to assist in 
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describing the interests and motivations of their customers, to the extent that 
transaction and account data is available for this puipose. 
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Kaleidoscope Powered Applications 

In the previous section, Kaleidoscope was presented as a data augmentation system 
for a diverse range of industries. Of key importance bens is the feet that Kaleidoscope 
is automatically describing the transaction data in terms of well-defined English 
keywords. This is both a powerful and very intuitive form of data to work with. This 
section illustrates this with applications mat can extract operational and analytic 
value fiom mis data in veiy easy to manage ways. 

Segmentation by Basic Search 

A portfolio of accounts can be segmented based on keyword description* in very 
™ich me same way that web pages can be located using a keyword based search 
cnaiie. In a Payment card example, a marketing manager may wish to market a 
nroduct to all people who stay at hotels. In this simple case, they simply enter the 
keyword "hoteF into the search window and a list of individuals appeals who have 
visited hotels: 




5: ysi.Cy'j p^c- 



In the screen above, the subset of data has been defined as simply those accounts that 
match to the keyword "hotel". Options on this include the ability to generalize 
«hoteT to higher-level categories and to make web-page associations between 
keywords and other keywords to enhance the matching. 
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The analysis presented on this page is an example only. The example accounts can be 
rank ordered by degree of match to the keywords. The user can select an account to 
view me transaction history and the description of mis individual. For the purposes of 
marketing, this would be useful for doing a small sampling of the customers and 
determining their suitability to the program being designed. 

An example of the individual account screen is shown below: 



Accmmt lnfoi^tion: 81691234 




In this screen is shown basic account information, the top keywords that describe this 
account, the top discriniinator^ format account (keywords mat are generally 

rare but strong in this individual account), and their transaction history. Each 
transaction is shown with the accompanying descriptive terms for that transaction and 
a reference URL. The purpose of showing the reference URL is to enable the user to 
further explore the individual characteristics of the customers that fall into mis 
particular list Other analysis will be applied later to group the reference URL and 
other information at the list leveL 



Segmentation by Advanced Search 

The lists of customers produced by the basic keyword search will be mterestmg to 
peruse, and possibly to utilize in a marketing campaign. However, as with web-based 
searches, Acre is significant additional value to be obtained from advanced search 
capabiliti 
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For example, these include: 

• Searching by multiple keywords with weightings on different keywords, 

# Searching by multiple keywords and associated keywords. The lists of 
associated keywords can be derived from web-page co-occurrence or other 
sources. 

« Exclusion rules to remove customers with strong affinity in certain keywords 
and categories, 

♦ The ability to enter a set of keywords and, from the list of matching 
customers, derive a second set of keywords that illustrate "like behavior* for 
that list 

* The ability to enter a product description in English language form and use 
this as the set of search terms to build a list 

The diagram below shows some examples of this more advanced search. 




. ! Analyze I K 



111 Business travelers with high cell-phone 
^usage and restaurant usage. Preferably 
luxury shopping and entertainment habits 



ml 



Describe j 



Segmentation by Clustering 
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Segmentation can also be done based on clustering. In this approach, similar 
accounts, transactions and/or merchants can be grouped together based on similarities 
in their underlying characteristics. The key advantage of Kaleidoscope in this realm 
is that there is additional descriptive data on which the clustering can be done. This 
will be especially valuable in situations where typically only numeric data exists. 

There are many segmentation algorithms in existence. Most of these will be able to 
utilize the added data provided by Kaleidoscope. 

Trend Analysis 

Kaleidoscope describes transactions and accounts in real-world terms. While this is 
very valuable, it is also important to look at how accounts, products and segments are 
changing over time. Trend analysis techniques can be applied to the Kaleidoscope 
enhanced descriptions to provide trending information for a variety of data categories 
including: 

• Account level trends of keyword importance over time. 

• Segment level trends by monitoring the importance of keywords within a 
defined segment over time* 

• Segment trends can be defined on an individual product to' look at bow the 
underlying population that uses that product is changing over time, 

• General customer demographic trends can be identified by looking at 
segment trends in segments defined by age, gender, geographic region and so 
on. 

This type of trend analysis can be represented in a number of ways, and some 
examples are shown in the diagram below: 
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Gcodcr: 


Female 


jgj Stals: 
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25 to 30 


|2 Revolving: 
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Cross-sell Analysis 

A product can bo described in terms of its current users and in terms of the desired 
market for the product. Kaleidoscope can facilitate cross-sell opportunities by 
examining the key characteristics of individuals who are currently using the product 
and then finding similar individuals to whom the product would be applicable. 
Bec ause this is based on behavioral keyword matches it gives better control at each 
stage of die cross-sell process. 

A cross-sell process could involve die following steps: 

1. Determine the key characteristics of the target product: Ibis can be done 
either by entering desired keywords for the target pro duct, or by using the 
system to determine the set of most discruninatmg keywords that describe 
the current users of the product (versus the overall population), or a 
combination of both. % . j 

2. Build A Segmentation: Using the new description, build a segmentation of 
individuals who currently do not have the product but most closely match the 
description of die product 

3. Refine the Segmentation: Refine the segmentation against standard criteria 
such as demographic filters (age, state, zip, gender, etc) and against 



IS 



PAGE 34/60 1 RCVD AT 8/8/2005 4:45:52 PM [Eastern Daylight Time] * SVR:U$PTQ-ff XRF-6/26 1 DNIS:2738300 * CSID:650 474 8401 * DURATION (mm-ss):13-20 



08/08/2005 ION 13:55 FAX 650 474 8401 GLENN PATENT GROUP 8035/060 

Disclosure Document Proprietary and Confidential 



Kaleidoscope 



operational characteristics (current balances, profitability, revolving rate, 
etc). 



Modeling Applications 

Kaleidoscope adds valuable data to transaction and account records. This data can be 
used in traditional predictive modeling to build models such as: 

* Risk models (bankruptcy, attrition). 

• Profitability. 

. Fraud (assist in detecting lost and stolen and counterfeit cards). 



• 



Customer service. 



These models can utilize standard numerical and symbolic data, and there are a wide 
^S^eeLiques available to do this. The value of Kaleidoscope m una domam 
is the addition of valuable descriptive data. 
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Additional Analysis 
Advertising Analysis 

The data that Kaleidoscope collects is, to some degrees, a first-order cross-reference 
to the Internet. Higher order analysis is also possible, for example: 

• In analyzing a merchant's website certain keywords may not be well 
explained from (he website alone. Additional related keywords can be 
extracted from other related websites. For example, a BMW service center 
may not describe BMWs veiy welL The keyword BMW may best be 
described from another site or set of sites. This second-order cross-reference 
can he especially valuable to enhance small merchant's web sites where it is 
not clear what the quality or marketing message is behind the products. 

• Additional correlative value can be extracted by characterizing where a 
product is advertised. Luxury goods are more likely to be advertised on 
financial websites, whereas more average priced goods are more likely to be 
advertised on sports or entertainment sites. Spidering die web and looking at 
the relationships between products, where they are advertised and with what 
other products are they advertised will provide useful correlative 
information. This is only possible for Internet advertising where the 
information is in an easy-to-access digital form that can be processed 
auto m atically from a spidering system. 

• Additional data can be extracted by looking at which web-pages link to the 
merchant reference URLs, and which pages they link to. This will identify 
partner programs and other related merchants. 

Additionally, the Kaleidoscope technology could easily be adopted to allow web- 
marketing companies like DoubleClick and AdForce to better understand consumer 
web behavior. Currently, these companies primarily target users by data such as time 
. of day, what state a person lives or works in and perhaps what company they work 
for, if they are in school, and what Web site they are looking at Kaleidoscope would 
add real-world keyword descriptions of the web sites visited by an individual in the 
same manner it did for Payment Card transactions. The result would be a richer 
description of the online consumer. 

Population Description 

Because it works in the domain of English language keywords and expressions, 
Kaleidoscope can provide real-world descriptions of populations and segments of 
populations. This may be of value in determining the key characteristics of different 
product users for the purposes of driving advertising and marketing content In this 
way, Kaleidoscope can actually provide the content creators with descriptive 
information that may help them better understand their audience, and hence better 
design advertising material* 
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Other Data Sources 

In this description of Kaleidoscope, the Internet is described as the key source for 
complete and up-to-date information about the world This is certainly an appealing 
and attractive source of data, and its importance is growing every day. However other 
specifically accessed data sources may provide benefit, such as: 

• Credit bureau data, especially relating to specific loaas granted or applied for. 

• Electronic encyclopaedia, dictionary and thesaurus data. Although this is not at 
all dynamic, it should provide some useful generalization ability to Kaleidoscope, 

• News services not directly available over the Internet 

• Electronic forms of the yellow and white pages, census data and general 
contemporary literature. If, for commercial reasons, this data is not available on 
the Internet, we may be able to purchase it for the building of the Kaleidoscope 
knowledgebase. 

• Internet transactions- We can describe an individual by the keywords associated 
with general Internet browsing. The specific keywords may be different, but we 
can build up a profile based on spidering the websites and pages thai they visit 
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